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Figure 4 

A 

FIX 17683 4 -GTCTGCAACATOCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

RACE_9 5_3 GTCTGCAACATOCGGCTGTCTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 
RACE_9 5_8 GTCTGCAACATOCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 
RACE_9 5_1 1 GTCTGCAACATOCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 



FIX GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAA- - CGATT ATTACCGAGT ACCGC ATT ACT 

RACE_95_3 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAA- - CGATTATTACCGAGT ACCGC ATT ACT 

RACE_9 5_8 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAA- -CGATTATTACCGAGTACCGCATTACT 

RACE_9 5_1 1 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAAAACGATTATTACCGAGTACCGCATTACT 



FIX GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

RACE_95_3 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

RACE_95_8 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 

RACE_9 5_1 1 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCG 



FIX TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 

RACE_95_3 TGGAC CTCACGTTG AACTAC C ACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 

RACE_9 5_8 TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 

RACE_9 5_1 1 TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 



fix tcaagagHgagggtacgcgctaaaggtgtatgacaacgggaaggtaagggcgaacgggt 

race_95_3 tcaagag 

RACE_95_8 TCAAGAGgGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGT 

RACE_95_11 TCAAGAG— 



fix aacgggtaggtaaccgcatggggtgtgaaatgacgttcggaacctgtgcttgcHaatca 

RACE_95_3 Z-J^ CK 

RACE_95_8 AACGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGC|aAATCA 

RACE_95_11 AATCA 



Fix ACGTGACCGAGGTGTCGTTGCTC ATCAGCG ACTTTAGACGTC AG AACCGTCGCGGCGGC A 

RACE_95_3 ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTATACGTCAGAACCGTCGCGGCGGCA 

RACE_95_8 ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCA 

RACE_9 5_1 1 ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCA 



FIX CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

RACE_95_3 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

RACE_95_8 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

RACE_9 5_1 1 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 



FIX AGTTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCA- 176346 

RACE_9 5_3 AGTTCAGCGTGCGGCTCTTTGCCAACTAOCCTGCGTCA 

RACE_9 5_8 AGTTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCA 

RACE_9 5„1 1 AGTTCAGCGTGCGGCTCTTTGCCAACTAOCCTGCGTCA 



8 

FIX 17 563 1 - CCGCGCGTC ATOAGTCCC AAAAACCTG ACGCCGTTCTTG ACGGCGTTGTGGCTGCTATTG 

RACE_9 5_3 CCGTGCGTCATQAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTG 
RACE_9S_8 CCGCGCGTCATQAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTG 
RACE_9 5_1 3 CCGCGCGTCATOAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTG 



FIX7 GGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAAC 

RACE_95_3 GGTC ACAGC CGC GTGCCGCGGGTACGCGC AGAAGAATGTTGCGAATTC ATAAACGTC AAC 

RACE_9 5_8 GGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAAC 

RACE_9 5_1 1 GGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAAC 



fix cacccgccggaacgctgttacgatttcaaaatgtgcaatcgcttcaccgtcgcSacgta 

race_9 5_3 c^cccgccggaacgctgttacgatttcaaaatgtgcaatcgcttcaccgtcgcg!acgta 

race_9 5_8 cacccgccggaacgctgttacgatttcaaaatgtgcaatcgcttcaccgtcgciiacgta 

race_9 5_1 1 cacccgccggaacgctgttacgatttcaaaatgtgcaatcgcttc accgtcgc 
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FIX TTTTTATGATTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTGATAG 

RACE_95_3 TTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAG 

RACE_95_8 TTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAG 

RACE_95_11 

FIX CCATGTTCCATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCAC^GCTG 

RACE„95_3 CCATGTTCCATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCAC^GCTG 

RACE_9 5_8 CCATGTTCCATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCAC|1GCTG 

RACE_95_11 GCTG 

FIX CGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATC 

RACE_9 5_3 CGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATC 

RACE_9 5_8 CGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATC 

RACE_9 5_1 1 CGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATC 

FIX GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGC 

RACE_95_3 GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAACTGC 

RACE_95_8 GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGC 

RACE_9 5_1 1 GTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGC 

FIX AACTACAATCCgAAGTCTCTTCCTCGAGGGCCTTACAGCCTATGGGAAAGTAAGACAGA 

RACE_9 5_3 AACTACAATCC 

RACE_9 5_8 AACTACAATCC 

RACE_95_11 AACTACAATCT . 

FIX GGGACAAAACATCATTAAAAAAAAAGTCTAATTTCACGTTTTGTACCCCCCCTTCCCCTC 

RACE_95_3 

RACE_95_8 

RACE_95_11 

FIX CGTGTTGT^GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

RACE_9 5_3 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

RACE_9 5_8 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

RACE_9 5_11 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACA 

FIX AGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAAT 

RACE_95_3 AGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAAT 

RACE_95_8 AGGCGCAGTACCTGCTGGGCGCCGCTGGCGGCGTTCCCTATCGATGGATCAACCTGGAAT 

RACE_9 5_1 1 AGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAAT 

FIX ACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACA 

RACE_9 5_3 ACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACA 

RACE_95_8 ACGACAAGATAGCCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACA 

RACE_9 5_1 1 ACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAATACA 

FIX AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA- 174887 

RACE_9 5_3 AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTOAATAATAAA 

RACE_9 5_8 AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

RACE_9 5_1 1 AACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

C 



FIX 174892 - CGCTAAAATGGGCT ATATGCTGC AGTGAAT^^^gATGTGTGTTTGTCCG A- 17 4841 

RACE_95_3 ... CGCTAAAATGGGCTATATGCTGCAGTGAAT^^HaTGTGTGTTTGTCCGCAAAAAAAA ... 

RACE_95_8 ... cgctaaaatgggctatatgctgcagtgaat^^^Satgtgtgtttgtccaaaaaaaaaa ... 
race_95_ii ... cgctaaaatgggctatatgctgcagtqaat^^^Satgtgtgtttgtccaaaaaaaaaa ... 



Fig. 4 UL131-128 mRNA processing - Panels (A-C) compare FIX-BAC DNA 
sequence (numbered according to Chee et al.) to a set of cDNA sequences from 
RACE clones 95-3, 95-8 and 95-11 (A) UL131 region, (B) UL128 region, (C) UL131- 
128 transcripts 3' end. Start codons, stop codons and the polyA site are in bold face, 
mRNA processing signals (splice donor sequence, splice acceptor sequence, 
AATAAA signal) are grey-shaded. 
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Figure 5 




Fig. 5 Exon-intron organization of the FIX-BAC UL131-128 genetic locus. UL131 
(green); UL130 (orange); UL128 (blue); UL128x1 C-terminus (light blue). 
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Figure 6 
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Translation of SEQKlon95-3.txt 



HCK-1 (pUL131) 



21 

121 
41 

181 

61 

241 



301 
101 



361 
121 



ATGCQGCT0TCTCGGOTOT0OC! 



Q R E T A E K 



X3TCTGTGCGCCGTGGTGCTOGQTCAGTGC 
CLCAVVLGQC 



TTATTACCGAGTACCGCATTACTGaOACOCOTGC 
DYYRV P H Y W D A C 



~ AA AC C CGTT A C AAGT ATGTGG AAC AGCTCGTGG AC CTC ACG 
QTRYKYVEQLVDLT 

TIG AACTACC ACTACG ATGCG AGCCACGGCTTGG ACAACTTTO ACGTG CTC AAG AGAATC 
LNYHYDASHG LDNFDVLKR I 



IGCTC^CGCCTCACGCCCGGAGCCTC 



S R A L P D 




pUL131 



EFSVRLFAN 



1128x2 128x1 T 





131x2 




131x1 



Translation of SEQKlon95-8.txt 



HCK-2(pUL131x1) 



pUL131x1 



LCRVWLSV 



G CCGTGGTGCTGGG TC AGTGC 
A V V L G Q C 



61 CAGCGGG AG ACCGC AGAAAAAA ACG ATTATTACCG AGTACCG C ATTACTGGG ACGCGTGC 

21 QRETAEKNDYYRVPHYWDAC 

121 TCTCGCGCG CTGCCTGACXrAAACCCGTTACAAGTATGTGGAACAGCTCGTGGACCTCACG 

41 SRALPDQTRYKYVEQLVDLT 

181 TTG AACT ACC ACTACG ATG CG AGCCACGGCTTGGAC AACTTTGACGTGCTC AAG AGGTGA 

61 LNYHYDASHG LDNFDVLKR * 



Fig. 6 Scheme of the differentially spliced transcripts of the UL131-128 region. Upper 
panel RACE clone 95-3 and predicted open reading frame (orf) pUL131 (HCK-1). 
Lower panel RACE clone 95-8 and predicted orf UL131x1 (HCK-2). 
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Figure 7 




FIX-BAC 



Translation of SEQ128 . txt ( 3,563 ; HCK-4 (pUL128) 



1 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTA'EEGTGACAGC 

1 MSPKNLTPPLTALWLLLGHS 

6 1 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 
21 RVPRVRAE ECCEFINVNHPP 



121 GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCACTGCGGTGTCCGGAC 
41 ERCYDFKMCNRFTVALRCPD 



128x2 




128x2 




128x1 







181 GGCGAAGTCTGCTACAGTCCCGAGAAACGGCTGAGATTCGCGGGATCGTCACCACCATG 
61 GEVCYS PEKTAEIRG IVTTM 

241 ACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGCAACTACAATCTG 
81 THS LTRQVVHNKLTSCNYNL 

301 TTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACAAGGCGCAGTAC 
101 LYLEADGR IRCGKVNDKAQY 

361 CTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAATACGACAAGATA 
121 LLGAAGSVPYRWINLEYDKI 

42 1 ACCCGGATGGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACAAACGGCTGGAT 
141 TRIVGLDQYLESVKKHKRLD 

481 GTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGA 
161 VCRAKMGYMLQ* 




pUL128 



Translation of SEQ128 x ltxt HCK"3 (pUL128x1) 



61 
21 



121 
41 



ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 
MSPKNLTPFLTALWLLLGHS 

CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 
RVPRVRAEECCEFINVNHPP 

GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTATTTTGA 
ERCYDFKMC NRFTVAYVFS * 



128x2 




128x2 












128x1 





pUL128x1 



Fig. 7 Scheme of the differentially spliced transcripts of the UL131-128 region. Upper 
panel SEQUL128B and predicted open reading frame (orf) pUL128 (HCK-4). Lower 
panel SEQUL128A and predicted orf UL128x1 (HCK-3). 



ROTHWELL, FIGG, ERNST & MANBECK 
Application Serial No.: New Application 
By: Gabriele HAHN 
Attorney: Robert B. Murray 
Attorney Docket No.: 2923-0545 
(Figures 1-3 in specification) 
6 of 20 



Figure 8 



Northern Blot Analys s 

RVFIX, RVFIX mutants and laboratory strains: 



& <y <y $ 

^ V V V V <S? 




Fig. 8 mRNA was prepared from RVFIX-infected fibroblasts 4 days p.i. using 
Rneasy Mini, QIAshredder and Oligotex mRNA Mini kits according to the 
manufacturer's guidance (Qiagen). For Nothern blotting, 1 fjg RNA was 
electrophoresed on an agarose gel according to the MOPS-formaldehyde protocol 
and blotted onto Hybond N+ membranes (Amersham Pharmacia). Blots were 
hybridized with a UL131-128 specific probe. 
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Figure 9 

Comparison RACE clone 95-3 - FIX genomic sequence 



Upper line: SEQFIX UL131-128.txt, from 10 to 1977 
Lower line: SEQKIon95-3.txt, from 1 to 1741 



SEQFIX UL131-128.txtiSEQKlon95-3.txt identity= 99 . 66% ( 1735/1741) 
gap=ll. 94% (236/1977) 



1 GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

Illlllllll IIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIMI 

1 ATGCGGCTGTCTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

61 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAACGATTATTACCGAGTACCGCATTACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 II 

52 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAACGATTATTACCGAGTACCGCATTACTGG 
121 GACGCGTGCTCTCGCGCGCTGCCTGACC AAACCCGTTAC AAGTATGTGGAAC AGCTCGTG 

1 1 1 1 1 1 1 1 1 L 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

112 GACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTAC AAGTATGTGGAAC AGCTCGTG 
181 GACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGCTC 

M II II I II II Ml 1 1 1 1 1 1 1 1 1 II 1 1 1 II M I Ml I Ml Mil I II I II I II 1 1 1 1 1 1 

172 GACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGCTC 
241 AAGAGGTGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGTAA 

him 

232 AAGAG 

301 CGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGCAGAATCAAC 

II Ml II 

235 AATCAAC 

361 GTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCACC 

MIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII IIIIIIIMIIIIIMIMIIIIII 

244 GTGACCGAGGTGTCGTTGCTC ATCAGCGACTTTATACGTCAGAACCGTCGCGGCGGC ACC 
421 AACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 

I M M M II I M 1 1 II 1 1 1 1 1 1 III I II 1 1 II II II II Ml II II I II I Ml MIMMI 

304 AACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 
481 TTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCACGGGAAATAATATGCTACGGCTTC 

1111 1 1 1 M 1 1 1 1 1 M 1 1 1 1 I [ 11 1 M 1 1 1 1 1 M 1 1 ! II I ! M I 

364 TTC AGCGTGCGGCTCTTTGCC AACTAGCCTGCGTC ACGGGAAATAATATGCTACGGCTTC 
541 TGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

424 TGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCTGG 
601 CGTCTCCGTGGTTCACGCTAACGGCGAACC AGAATCCGTCCCCGCC ATGGTCTAAACTGA 

II I II II I II II 1 1 II I II I II III II II I II MMIM II Mill III II III III II I 

484 CGTCTCCGTGGTTC ACGCTAACGGCGAACCAGAATCCGTCCCCGCCATGGTCTAAACTGA 
661 CGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 

1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 

544 CGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 
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721 CACGGTCCCCCTCGC AATTCCCGGGGTTCCAGCGGGTATC AACGGGTCCCGAGTGTCGC A 

I I I I 1 I I I I I I II I I I I I II I IIIIIIIIMMIIIIII I II I II I I I I I M II I I I I 
604 CACGGTCCCCCTCGC AATTCCCGGGGTTCC AGCGGGTATCAACGGGTCCCGAGTGTCGCA 

781 ACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTCCA 

I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 III 1 1 1 III 1 1 1 II I M Ml 1 1 II 1 1 1 1 1 1 1 1 II 

664 ACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTCCA 
841 CCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGC AATC AGACC ATCCTCC AACGGA 

1 1 M M I 1 1 1 M 1 1 1 1 1 1 1 1 M I I III 1 1 MM 1 1 M 1 1 1 1 II 1 1 1 1 1 1 M M 1 1 1 1 

724 CCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGC AATCAGACCATCCTCCAACGGA 
901 TGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGCAGATCAGCGTGGAAGACGCCA 

1 1 II II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M I I II 1 1 II III I II I II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 

784 TGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGC AGATCAGCGTGGAAGACGCCA 
961 AGATTTTTGGAGCGCACATGGTGCCCAAGCAGACCAAGCTGCTACGTTTCGTCGTCAACG 

II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II I II II II I II I II I II 1 1 II II II 1 1 1 1 1 Mill 

844 AGATTTTTGGAGCGCAC ATGGTGCCCAAGC AGACCAAGCTGCTACGTTTCGTCGCC AACG 
1021 ATGGCAC ACGTTATCAGATGTGTGTGATGAAACTGGAGAGCTGGGCCCACGTCTTCCGGG 

llllllllllllllllll MM llllllllllllllllllllllllllllllllllllll 

904 ATGGCAC ACGTTATCAGATGTGTGTGATGAAACTGGAGAGCTGGGCCCACGTCTTCCGGG 
1081 ACTACAGCGTGTCTTTTC AGGTGCGATTGACGTTC ACCGAGGCCAATAACC AGACTT ACA 

II II II 1 1 II I II Mill I II I II II II IIMIMI II MM Mill Ml 1 1 1 1 II II M 

964 ACT ACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAATAACC AGACTT ACA 
1141 CCTTCTGC ACCC ATCCC AATCTC ATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTGAA 

II II M II I II M II M I MM 1 1 II III llllll II I II I Mill 1 1 1 1 1 II I MM II 

1024 CCTTCTGC ACCCATCCC AATCTC ATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTGAA 
1201 AACCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 

IMM II M I II II II 1 1 1 II M I Ml II M III Ml I III II I II MM I Ml I II II 

1084 AACCGTGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 
1261 TGGGTCAC AGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTC A 

1 1 II 1 1 1 1 M 1 1 1 II II 1 1 1 1 1 II II II II II III I II I II 1 1 1 II I MM 1 1 II I II II 

1144 TGGGTCACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCA 
1321 ACC ACCCGCCGGAACGCTGTTACGATTTC AAAATGTGC AATCGCTTC ACCGTCGCGTACG 

Mill II 1 1 1 1 II II III 1 1 II II 1 1 1 1 II II MM I M II 1 1 M M 1 1 1 1 1 1 1 M II II 

1204 ACCACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACG 
13 81 TATTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTGAT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 

1264 TATTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGAT 
1441 AGCC ATGTTCCATCGACGATCCTCGGG AATGCCAGAGTAGATTTTC ATGAATCC AC AGGC 

II M 1 1 1 II II II M II II 1 1 M M MM I II III Mill I II III III I II Ml M 1 1 1 

1324 AGCC ATGTTCCATCGACGATCCTCGGGAATGCCAGAGTAGATTTTC ATGAATCCAC AGGC 
1501 TGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGA 

II M II I II 1 1 II II M 1 1 1 1 M 1 1 Mill II Ml MM II M MM III M MM M 1 1 

1384 TGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGA 

1561 TCGTCACC ACC ATGACCC ATTCATTGACACGCC AGGTCGTAC ACAAC AAACTGACGAGCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I 
1444 TCGTCACC ACC ATGACCCATTC ATTGACACGCC AGGTCGTAC ACAAC AAACTGACGAACT 

1621 GCAACTACAATCCGTAAGTCTCTTCCTCGAGGGCCTTACAGCCTATGGGAAAGTAAGACA 
MIMMMMM 

1504 GC AACTAC AATCC 
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1681 GAGGGACAAAACATC ATTAAAAAAAAAGTCTAATTTC ACGTTTTGTACCCCCCCTTCCCC 

1517 

1741 TCCGTGTTGTAGGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAACGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 

1517 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAACGA 

1801 C AAGGCGCAGTACCTGCTGGGCGCCGCTGGC AGCGTTCCCTATCGATGGATC AACCTGGA 

1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; I! 1 1 1 1 1 II II I II II 1 1 1 1 1 1 

1565 CAAGGCGC AGTACCTGCTGGGCGCCGCTGGC AGCGTTCCCTATCGATGGATCAACCTGG A 

1861 ATACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACA 

I I i 1 1 I 1 I I I I 1 I I 1 I 1 1 I t t 1 1 1 I I I I 1 I I I 1 I I I I I I I I I 1 I 1 1 I I I t I 1 I I I I 1 I 1 I 
1625 ATACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACA 

1921 C AAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAA 

IMlllllll I Mill MINI MMIIII III MM MINIM M Mill MM! 

1685 CAAACGGCTGGATGTGTGCCGCGCT AAAATGGGCTATATGCTGC AGTGAATAATAAA 



Translation of SEQKIon95-3.txt: HCK-1 (pUL131) 



1 ATGCGGCTGTCTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTGGGTCAGTGC 

1 MRLSRVWLSVCLCAVVLGQC 

61 cagcgggagaccgcagaaaaaaacgattattaccgagtaccgcattactgggacgcgtgc 

21 qretaekIdyyrvphywdac 

121 tctcgcgcgctgcctgaccaaacccgttacaagtatgtggaacagctcgtggacctcacg 

41 SRAL PDQTRYKYVEQLVDLT 

181 TTGAACTACCACTACGATGCGAGCCACGGCTTGGAC AACTTTGACGTGCTC AAGAGAATC 

61 l|yhydashgld!fdvlkri 

241 aacgtgaccgaggtgtcgttgctcatcagcgactttatacgtcagaaccgtcgcggcggc 

81 |vtevsllisdfirq|rrgg 

301 accaacaaaaggaccacgttcaacgccgccggttcgctggcgcctcacgcccggagcctc 

ioi t|krttf|aagslapharsl 

361 gagttcagcgtgcggctctttgccaactag 

121 efsvrlfaJI* 
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Figure 10 

Comparison RACE clone 95-8 -FIX genomic sequence 

Upper line: SEQFIX UL131-128.txt, from 10 to 1977 
Lower line: SEQKIon95-8.txt, from 1 to 1849 



SEQFIX UL131-128.txt: SEQKlon95-8.txt identity= 99 . 78% ( 1845 /1849 ) 
gap=6.47%(128/1977) 



1 GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

I II I II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II Mill 1 1 1 II II I II Ml II 1 1 1 II I 

1 ATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

61 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAACGATTATTACCGAGTACCGCATTACTGG 

Ml 1 1 1 1 1 II II Ml III 1 1 1 1 1 1 1 II II I II II I MM I II III 1 1 II I M M 1 1 1 II I 

52 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAACGATTATTACCGAGTACCGCATTACTGG 
121 GACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCGTG 

MIMIIIIIIIIIIIIII IIIIIIIMMIIIIIIIIIIIIIMIIIIIII lllllll I 

112 GACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTACAAGTATGTGGAACAGCTCGTG 
181 GACCTCACGTTGAACTACC ACTACGATGCG AGCC ACGGCTTGGACAACTTTGACGTGCTC 

II I II M I II M 1 1 1 II II 1 1 1 M I II 1 1 1 M II Mill I MM M I II I M II I II M I 

172 GACCTCACGTTGAACTACC ACT ACGATGCGAGCC ACGGCTTGGACAACTTTGACGTGCTC 
241 AAGAGGTGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGTAA 

II I II III M 1 1 M 1 1 1 M II MM 1 1 1 II 1 1 1 M MM I II II M 1 1 M II 1 1 1 II II I 

232 AAGAGGTGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGTAA 
301 CGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGCAGAATCAAC 

1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

292 CGGGTAGGTAACCGC ATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGC AGAATC AAC 
361 GTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCACC 

1 1 II III Mill M II 1 1 1 1 1 1 II III II II II MM II II IMIMM II II I II Ml 

352 GTGACCGAGGTGTCGTTGCTC ATC AGCGACTTTAGACGTC AGAACCGTCGCGGCGGC ACC 
421 AACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 

1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 M I i 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 

412 AAC AAAAGGACC ACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCGAG 
481 TTCAGCGTGCGGCTCTTTGCCAACTAGCCTGCGTCACGGGAAATAATATGCTACGGCTTC 

k 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

472 TTC AGCGTGCGGCTCTTTGCC AACTAGCCTGCGTC ACGGGAAATAATATGCTACGGCTTC 
541 TGCTTCGTCACC ACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGC AACGCCCTGTCTGG 

MINIMI MINI I III 1 1 MINIM IIIIIIIIIIIIIIMIIIIIIIIIIIIII 

532 TGCTTCGTCACCACTTTC ACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCTGG 
601 CGTCTCCGTGGTTCACGCTAACGGCGAACC AGAATCCGTCCCCGCCATGGTCTAAACTGA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 E 1 1 1 II E 1 1 M I M 1 1 1 1 1 M 1 1 1 1 M 1 1 11 1 1 

592 CGTCTCCGTGGTTCACGCTAACGGCGAACC AGAATCCGTCCCCGCCATGGTCTAAACTGA 
661 CGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 E 1 1 1 1 1 1 1 1 

652 CGTATCCC AAACCGC ATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCCCC 
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721 CACGGTCCCCCTCGC AATTCCCGGGGTTCC AGCGGGTATCAACGGGTCCCG AGTGTCGCA 

M 1 1 1 M 1 1 1 1 1 1 1 1 M M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 11 1 1 1 1 1 1 1 1 1 II M 1 1 1 

712 CACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATCAACGGGTCCCGAGTGTCGCA 
781 ACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTCCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 k 1 1 1 1 1 1 1 

772 ACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTCCA 
841 CCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGCAATC AGACC ATCCTCC AACGGA 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

832 CCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGCAATCAGACCATCCTCCAACGGA 
901 TGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGCAGATCAGCGTGGAAGACGCCA 

M I II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 

892 TGCCCCGAACGGCTTCGAAACCGAGCGACGG AAACGTGC AGATC AGCGTGGAAGACGCC A 
961 AGATTTTTGGAGCGC AC ATGGTGCCC AAGC AGACC AAGCTGCTACGTTTCGTCGTC AACG 

1 1 1 ! i 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

952 AGATTTTTGGAGCGC AC ATGGTGCCC AAGCAGACC AAGCTGCTACGTTTCGTCGTC AACG 
1021 ATGGC AC ACGTTATC AGATGTGTGTGATGAAACTGG AGAGCTGGGCCCACGTCTTCCGGG 

Ml I M I II II I II I 1 1 IM II II M II I III III II 1 1 III 1 1 1 1 1 II II MM I M I 

1012 ATGGC AC ACGTTATC AGATGTGTGTGATGAAACTGGAGAGCTGGGCCC ACGTCTTCCGGG 
1081 ACTACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAATAACCAGACTTACA 

1 1 1 MM II III 1 1 1 1 II 1 1 1 II I M I III M III III II III I IMIIIIIIIIIIII 

1072 ACTAC AGCGTGTCTTTTCAGGTGCGATTGACGTTC ACCGAGGCCGATAACC AGACTTAC A 
1141 CCTTCTGC ACCC ATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGC AGGGAATTTTGAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 M 1 1 

1132 CCTTCTGC ACCCATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGC AGGGAATTTTGAA 
1201 AACCGCGCGTCATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 

I II I II 1 1 1 1 1 III II I III 1 1 1 II III 1 1 III II Mill III 1 1 1 1 III 1 1 Ml Ml II 

1192 AACCGCGCGTC ATGAGTCCC AAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTAT 

1261 TGGGTC AC AGCCGCGTGCCGCGGGTACGCGC AGAAGAATGTTGCGAATTC AT AAACGTC A 

I I I I I I I I I I 1 1 I I I I I I I I I I 1 I I 1 I I 1 I I I I I I 1 I I I I 1 I 1 I I 1 I I I 1 I I I I I I I 1 I I 
1252 TGGGTC AC AGCCGCGTGCCGCGGGTACGCGC AGAAGAATGTTGCGAATTC AT AAACGTCA 

1321 ACC ACCCGCCGGAACGCTGTT ACGATTTCAAAATGTGCAATCGCTTC ACCGTCGCGT ACG 

M 1 1 1 1 1 1 1 II 1 1 1 1 1 M I II I II 1 1 1 1 1 II I II 1 1 1 M M 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 M 

1312 ACC ACCCGCCGGAACGCTGTTACGATTTCAAAATGTGC AATCGCTTC ACCGTCGCGTACG 
13 81 TATTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTGAT 

Ml 1 1 1 III II I II 1 1 1 M 1 1 II MM M II I III II II 1 1 II 1 1 1 1 1 II I II MM II 

1372 TATTTTCATGATTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGAT 
1441 AGCC ATGTTCC ATCGACGATCCTCGGGAATGCC AGAGTAGATTTTCATGAATCCAC AGGC 

I Mill III II I II II Ml IMIMI III II IMM Mill II Ml II II I II Ml II I 

1432 AGCC ATGTTCC ATCGACGATCCTCGGGAATGCC AG AGTAGATTTTC ATGAATCC ACAGGC 
1501 TGCGGTGTCCGGACGGCGAAGTCTGCTAC AGTCCCGAGAAAACGGCTGAGATTCGCGGGA 

1 1 1 1 ! 1 1 II 1 1 1 1 f 1 1 1 M 1 1 1 1 1 1 M I i 1 1 1 1 1 1 1 ! I i 1 1 1 ! 1 1 1 1 1 1 1 

1492 TGCGGTGTCCGGACGGCGAAGTCTGCTAC AGTCCCGAGAAAACGGCTGAGATTCGCGGGA 
1561 TCGTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCT 

Ml 1 1 1 M MM 1 1 M I II 1 1 M Mill 1 1 III I II II M 1 1 1 II II M I MM I II I 

1552 TCGTCACCACCATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCT 
1621 GC AACTAC AATCCGTAAGTCTCTTCCTCGAGGGCCTTAC AGCCTATGGGAAAGT AAGAC A 

I I i M 1 1 1 1 1 1 ! I 

1612 GCAACTACAATCC 
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1681 GAGGGACAAAACATCATTAAAAAAAAAGTCTAATTTCACGTTTTGTACCCCCCCTTCCCC 
1625 

1741 TCCGTGTTGTAGGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGA 

I I II I I I I I I I I I I I I I I I I II II I III I I II I I I I I I I I I I I I I I I I 
1625 GTT ATACCTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAACGA 

1801 CAAGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGA 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II MM 1 1 1 II 1 1 1 1 1 1 II 1 1 II 1 1 

1673 CAAGGCGCAGTACCTGCTGGGCGCCGCTGGCGGCGTTCCCTATCGATGGATCAACCTGGA 
1861 ATACGACAAGATAACCCGGATCGTGGGCCTGGATC AGTACCTGGAGAGCGTTAAGAAACA 

MMMMMMI M M M M M M I M M I M M I M M M M M 1 1 1 M M M M M 

1733 ATACGAC AAGATAGCCCGGATCGTGGGCCTGGATC AGTACCTGGAGAGCGTTAAGAAACA 
1921 CAAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGC AGTGAATAAT AAA 

II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II I II II III Mill MM I M III MMIMMI 

1793 C AAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGC AGTGAATAATAAA 



Translation of SEQKIon95-8.txt: HCK-2 (pUL131x1) 



1 ATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTGGGTCAGTGC 

1 MRLCRVWLSVCLCAVVLGQC 

61 CAGCGGGAGACCGCAGAAAAAAACGATTATTACCGAGTACCGCATTACTGGGACGCGTGC 

21 qretaekIdyyrvphywdac 

121 tctcgcgcgctgcctgacc aaacccgttacaagtatgtggaac agctcgtggacctc acg 

41 sralpdqtrykyveqlvdlt 

181 ttgaactaccactacgatgcgagccacggcttggacaactttgacgtgctcaagaggtga 

61 'l|yhydashgld|fdvlkr * 
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Figure 1 1 

Comparison RACE clon 95-1 1 -FIX genomic sequenc 



Upper line: SEQFIX UL131-128.txt, from 10 to 1977 
Lower line: SEQKIon95-11.txt, from 1 to 1620 

SEQFIX UL131-128.txtiSEQKlon95-ll.txt identity= 99 . 57% (1611/1618) 
gap=18.24%(361/1979) 



1 GTCTGCAACATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

INI II Mill 1 1 1 1 II II I MM I Mill 1 1 MM I II MM 1 1 MM I 

1 ATGCGGCTGTGTCGGGTGTGGCTGTCTGTTTGTCTGTGCGCCGTGGTGCTG 

6 1 GGTCAGTGCCAGCGGGAGACCGCAG . . AAAAAAACGATTATTACCGAGTACCGCATTACT 

MMMIMMMMMMMMM M M M M M M M I M M M M M M M M M 

52 GGTCAGTGCCAGCGGGAGACCGCAGAAAAAAAAACGATTATTACCGAGTACCGCATTACT 
119 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTAC AAGTATGTGGAAC AGCTCG 

M MM M Mill II Mill M II II M Mill Mill IMIMM M MM M M III I 

112 GGGACGCGTGCTCTCGCGCGCTGCCTGACCAAACCCGTTAC AAGTATGTGGAACAGCTCG 
179 TGGACCTC ACGTTGAACTACC ACTACGATGCGAGCCACGGCTTGGAC AACTTTGACGTGC 

M IMM I Mill M Mill M II II M Mill MUM M MM I M IMM M MM I 

172 TGGACCTCACGTTGAACTACCACTACGATGCGAGCCACGGCTTGGACAACTTTGACGTGC 
239 TCAAGAGGTGAGGGTACGCGCTAAAGGTGTATGACAACGGGAAGGTAAGGGCGAACGGGT 

MIMM 

232 TCAAGAG 

299 AACGGGTAGGTAACCGCATGGGGTGTGAAATGACGTTCGGAACCTGTGCTTGCAGAATCA 

MMI 

237 AATCA 

359 ACGTGACCGAGGTGTCGTTGCTCATCAGCGACTTTAGACGTCAGAACCGTCGCGGCGGCA 

MMMMMMMMMMMMMMIMMMMMMMMMMMMMMMI 

244 ACGTGACCGAGGTGTCGTTGCTC ATCAGCGACTTTAGACGTC AGAACCGTCGCGGCGGC A 
419 CCAACAAAAGGACCACGTTCAACGCCGCCGGTTCGCTGGCGCCTCACGCCCGGAGCCTCG 

1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 II I Ml 1 1 1 II 1 1 1 II I II 1 1 1 1 1 1 Ml 1 1 II I II I 

304 CCAAC AAAAGGACC ACGTTC AACGCCGCCGGTTCGCTGGCGCCTC ACGCCCGGAGCCTCG 
479 AGTTCAGCGTGCGGCTCTTTGCC AACTAGCCTGCGTCACGGGAAATAATATGCTACGGCT 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 M 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

3 64 AGTTC AGCGTGCGGCTCTTTGCC AACTAGCCTGCGTCACGGGAAATAATATGCTACGGCT 
539 TCTGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

424 TCTGCTTCGTCACC ACTTTC ACTGCCTGCTTCTGTGCGCGGTTTGGGCAACGCCCTGTCT 
599 GGCGTCTCCGTGGTTC ACGCTAACGGCGAACC AGAATCCGTCCCCGCC ATGGTCTAAACT 

II MM 1 1 MM III Mill II MM II Mill I Mill 1 1 Mill II MM II Mill I 

484 GGCGTCTCCGTGGTTC ACGCTAACGGCGAACC AGAATCCGTCCCCGCC ATGGTCTAAACT 
659 GACGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCC 

I I I M 1 1 1 1 II I II 1 1 1 1 II II M M I Ml 1 1 1 II 1 1 1 1 1 1 1 1 M I II II 1 1 1 1 1 II 1 1 1 

544 GACGTATCCCAAACCGCATGACGCGGCGACGTTTTACTGTCCTTTTCTCTATCCCTCGCC 
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719 CCC ACGGTCCCCCTCGCAATTCCCGGGGTTCC AGCGGGTATCAACGGGTCCCGAGTGTCG 

cnA IMIMMIMIIIIMI IMIIMIIIIIIIIIIMIIII 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 

604 CCCACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATTAACGGGTCCCGAGTGTCG 
779 CAACGAGACCCTGTATCTGCTGTAC AACCGGGAAGGCC AGACCTTGGTGGAGAGAAGCTC 

c Ml MM I Mill II Ml MM III I II Mill II MM MM I II I Mil II 1 1 II I II 

664 C AACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTGGAGAGAAGCTC 
839 CACCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGCAATCAGACCATCCTCCAACG 

MINIM II MINIM INI III I M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

724 CACCTGGGTGAAAAAGGTGATCTGGC ATCTGAGCGGTCGC AATCAGACC ATCCTCCAACG 
899 GATGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGCAGATCAGCGTGGAAGACGC 

_ Mill! I Ml I MM Ml III I III III Mill Ml Mill Mill I Ml III I MM II 

784 GATGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGCAGATC AGCGTGGAAGACGC 
959 CAAGATTTTTGGAGCGCACATGGTGCCCAAGCAGACCAAGCTGCTACGTTTCGTCGTCAA 

IMIIIIIIMIIMIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII III 

844 CAAGATTTTTGGAGCGCACATGGTGCCC AAGCAGACCAAGCTGCTACGTTTCGTCGCCAA 
1019 CGATGGCAC ACGTTATC AGATGTGTGTGATGAAACTGGAGAGCTGGGCCC ACGTCTTCCG 

an IIIIIIIIIIIIMII I MINI lllllllllllllll II MINIMI II MINIM 

904 CGATGGCAC ACGTTATTAGATGTGTGTGATGAAACTGGAGAGCTGGGCCCACGTCTTCCG 
1079 GGACTACAGCGTGTCTTTTC AGGTGCGATTGACGTTCACCGAGGCCAATAACCAGACTTA 

1 1 M II 1 1 1 1 II I II 1 1 1 II II II II II I II I II 1 1 II II I II II II 1 1 1 II 1 1 II II 1 1 

964 GGACTACAGCGTGTCTTTTCAGGTGCGATTGACGTTCACCGAGGCCAATAACCAGACTTA 
1139 CACCTTCTGCACCCATCCCAATCTCATCGTTTGAGCCCGTCGCGCGCGCAGGGAATTTTG 

1 1 1 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 1 1 1 j 1 1 1 r 1 1 r 1 1 E 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 

1024 CACCTTCTGC ACCC ATCCCAATCTC ATCGTTTGAGCCCGTCGCGCGCGC AGGGAATTTTG 
1199 AAAACCGCGCGTC ATG AGTCCC AAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCT 

1 1 1 1 J r 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 F [ 1 1 1 1 1 1 J ! 1 1 1 

1084 AAAACCGCGCGTCATGAGTCCC AAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCT 
12 59 ATTGGGTCAC AGCCGCGTGCCGCGGGTACGCGC AGAAGAATGTTGCGAATTC ATAAACGT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 j 1 [[ I [ [ I ! 1 1 1 1 1 1 1 1 1 1 1 1 

1144 ATTGGGTC ACAGCCGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTC ATAAACGT 
1319 CAACCACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTC ACCGTCGCGTA 

IIMMIIIIIIIIIIMIIIIIlllimillllllllllllllllllllllllll 

1204 CAACCACCCGCCGGAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGC . . 
1379 CGTATTTTC ATGATTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTG 
1262 

1439 ATAGCC ATGTTCC ATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCC ACAG 



1262 



1499 GCTGCGGTGTCCGGACGGCGAAGTCTGCTAC AGTCCCGAGAAAACGGCTGAGATTCGCGG 

I M I M M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M I II 1 1 1 1 1 M 1 1 1 1 1 1 1 

12 62 ACTGCGGTGTCCGGACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGG 
1559 GATCGTC ACCACC ATGACCC ATTCATTGAC ACGCCAGGTCGTACAC AAC AAACTGACGAG 

II II M I II I II 1 1 1 1 II II I II 1 1 1 1 II I M I II II I II II II II I II M I II II II II 

1321 GATCGTC ACC ACCATG ACCCATTCATTGACACGCC AGGTCGTACAC AAC AAACTGACGAG 
1619 CTGCAACTAC AATCCGTAAGTCTCTTCCTCGAGGGCCTTAC AGCCTATGGGAAAGTAAGA 

N I III I IN 1 1 1 1 

1381 CTGCAACTACAATCT . . 
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1679 CAGAGGGACAAAACATCATTAAAAAAAAAGTCTAATTTCACGTTTTGTACCCCCCCTTCC 

1396 

1739 CCTCCGTGTTGTAGGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAAC 

1 1 ! I i I ! I L 1 1 1 1 1 1 1 1 [ ! 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 

1396 GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGC AAAGTGAAC 

1799 GACAAGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTG 

IMMIMIMI MMMMMMMIMMMMMMMMMIIMMMMMII 

1442 GACAAGGCGCAGTACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTG 
1859 GAATACGACAAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAA 

I I M Mill 1 1 MM Mill III IMMIMMIMM II M II Mi M I II 1 1 II I 

1502 GAATACGAC AAGATAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAA 
1919 CAC AAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAAT AAT AAA 

1 1 II II II I II I M 1 1 1 II 1 1 1 II II I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

1562 TAC AAACGGCTGGATGTGTGCCGCGCTAAAATGGGCTATATGCTGC AGTGAAT AATAAA 
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Figure 12 



Comparison SEQ 128 B - FIX genomic sequence 



Upper line: FIX genomic sequence 
Lower line: SEQ 128 B 



5998 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

MINI I MINI lllllh Mill I! IIIMMIMIIIIIIMM | MM II MM: i 

1 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

6058 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTC ATAAACGTCAACC ACCCGCCG 

C1 M 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 II II 1 1 1 1 1 1 II 1 1 M II 

61 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 

6118 GAACGCTGTTACGATTTC AAAATGTGCAATCGCTTCACCGTCGCGTACGTATTTTCATGA 

101 IIIIIMIMIIIIIimilllllllllllMlllllllllllliiniMIIIIIIM 

121 GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCGTACGTATTTTC ATGA 

6178 TTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAGCCATGTTCC 

1Q1 1 1 1 i 1 1 1 1 1 1 11 1 ] I [! 1 1 1 1 1 1 1 1 [ 1 1 1 1 1[ | [ 1 1 [ M 1 1 1 1 1 1 1 1 MINIM 

181 TTGTCTGCGTTCTGTGGTGCGTCTGGATCTGTCTCTCGACGTTTCTGATAGCCATGTTCC 

6238 ATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCACAGGCTGCGGTGTCCG 

MIUMMMMMI I II Mill II MINIMUM MINIMI II MINI II Ml 

241 ATCGACGATCCTCGGGAATGCCAGAGTAGATTTTCATGAATCCACAGGCTGCGGTGTCCG 

6298 GACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATCGTCACCACC 

, 1 1 M M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

301 GACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATCGTCACCACC 

6358 ATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGCAACTACAAT 

i f 1 1 1 1 1 1 1 1 1 1 1 1 1 m i 1 1 ii 1 1 1 i 1 1 1 1 1 1 1 1 [ 1 1 1 1 [ 1 1 1 1 f 1 1 1 m 1 1 1 r 1 1 1 1 1 1 

361 ATGACCCATTC ATTGACACGCC AGGTCGTACACAAC AAACTGACGAGCTGCAACTAC AAT 

6418 CCGTAAGTCTCTTCCTCGAGGGCCTTACAGCCTATGGGAAAGTAAGACAGAGGGAC AAAA 

421 CC 

6478 CATC ATTAAAAAAAAAGTCTAATTTC ACGTTTTGTACCCCCCCTTCCCCTCCGTGTTGTA 

423 

6538 GGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACAAGGCGCAGT 

4 „ MINIMUM Mill I II III III I III II I III 1 1 MM I Ml MM MM 

423 . GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACAAGGCGCAGT 

6598 ACCTGCTGGGCGCCGCTGGCGGCGTTCCCTATCGATGGATC AACCTGGAATACGACAAGA 

r Mill II Mill 1 1 II II 1 1 Mill III III II I Mill II III 1 1 Mill 1 1 IIIIMI 

482 ACCTGCTGGGCGCCGCTGGCGGCGTTCCCTATCGATGGATC AACCTGGAATACGACAAGA 

6658 TAGCCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACAAACGGCTGG 

7 . 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 [ I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

542 TAGCCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAACACAAACGGCTGG 
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6718 ATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAAATGTGTGTTTGTCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 IN 

602 ATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAAATGTGTGTTTGTCC 
6778 GAAATACGCGTTTTGAGATTTCTG 

III I M 

662 AAAAAAAAAAAAAAAAAAAAAAAA 



Translation of SEQ128 B x 1.txt: HCK-3 (pUL128x1) 



1 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

l mspk|ltpfltalwlllghs 

61 cgcgtgccgcgggtacgcgcagaagaatgttgcgaattcataaacgtcaaccacccgccg 

21 rvprvraeeccefiIvIhpp 

121 gaacgctgttacgatttcaaaatgtgcaatcgcttcaccgtcgcgtacgtattttcatga 

41 ercydfkmcIrftvayvfs* 
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Figure 13 

Comparison SEQ 128 A 

Upper line: FIX-BAC 
Lower line: SEQ128 A 



FIX genomic sequence 



5998 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

I I I II II I II I I I I I I I I I I I I I I I I I I I I II II I I I I I I I II I I II I I I I I I I I I I I I I 

1 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 

6058 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 

1 1 1 1 IMIM 1 1 1 1 1 1 1 1 II MM I II IN II M I M M 1 1 1 1 II MM 1 1 M 1 1 1 1 1 1 1 

61 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 
6118 GAACGCTGTTACGATTTC AAAATGTGC AATCGCTTC ACCGTCGCGTACGTATTTTTATGA 

IMMMMMMMM MMMMMMMMMMMIMM 

121 GAACGCTGTT ACGATTTCAAAATGTGCAATCGCTTC ACCGTCGC 

6178 TTGTCTGCGTTCTGTGGTGCGTCTGGATTTGTCTCTCGACGTTTCTGATAGCC ATGTTCC 

166 

6238 ATCGACGATCCTCGGGAATGCC AGAGTAGATTTTC ATGAATCC ACAGGCTGCGGTGTCCG 

I I I I I I I I I I I I I 

166 GCTGCGGTGTCCG 

6298 GACGGCGAAGTCTGCTAC AGTCCCGAGAAAACGGCTGAGATTCGCGGGATCGTC ACCACC 

I I Ml I II 1 1 1 1 M M 1 1 1 II II II 1 1 II I Mill 1 1 II I II I II I II I II I II 1 1 1 II I 

178 GACGGCGAAGTCTGCTACAGTCCCGAGAAAACGGCTGAGATTCGCGGGATCGTCACCACC 
6358 ATGACCCATTC ATTGACACGCC AGGTCGTAC ACAAC AAACTGACGAGCTGC AACTACAAT 

I II II I II MM I M 1 1 1 1 1 1 1 II III I II 1 1 1 II II I II I II M 1 1 M Ml I Ml M II 

238 ATGACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGCAACTACAAT 

6418 CCGTAAGTCTCTTCCTCGAGGGCCTTAC AGCCTATGGGAAAGTAAGACAGAGGGAC AAAA 

298 CC 

6478 CATCATTAAAAAAAAAGTCTAATTTC ACGTTTTGTACCCCCCCTTCCCCTCCGTGTTGTA 

300 

6538 GGTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACAAGGCGCAGT 

MM I Ml 1 1 1 M 1 1 II II M 1 1 1 Ml 1 1 1 III 1 1 1 1 1 1 1 M II II I II 1 1 1 II I M M 

300 . GTTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACAAGGCGCAGT 
6598 ACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAATACGACAAGA 

I II Mill M 1 1 M I M I II II 1 1 1 MM II I II I II I II I II I II 1 1 1 1 II I II 1 1 1 II 

359 ACCTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATC AACCTGGAATACGACAAGA 
6658 TAACCCGGATCGTGGGCCTGGATCAGTACCTGGAGAGCGTTAAGAAAC AC AAACGGCTGG 

MMMMMMMMMMMMMMMMIM MMMMMMMMMMMMI 

419 TAACCCGGATCGTGGGCCTGGATC AGTACCTGGAGAGCGTTAAGAAACACAAACGGCTGG 
6718 ATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAAATGTGTGTTTGTCC 

1 1 1 II I II I II II I M 1 1 1 1 II M MM II 1 1 II I II III I II 1 1 II 1 1 1 1 II II I M II 

479 ATGTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGAATAATAAAATGTG 
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Translation of SEQ128 A: HCK-4 (pUL128) 

1 ATGAGTCCCAAAAACCTGACGCCGTTCTTGACGGCGTTGTGGCTGCTATTGGGTCACAGC 
1 MSPK§LTPFLTALWLLLGHS 

61 CGCGTGCCGCGGGTACGCGCAGAAGAATGTTGCGAATTCATAAACGTCAACCACCCGCCG 
21 RVPRVRAEECCEFllvlHPP 

121 GAACGCTGTTACGATTTCAAAATGTGCAATCGCTTCACCGTCGCACTGCGGTGTCCGGAC 

41 ercydfkmcIrftvalrcpd 

181 QGCGAAGTCTGCTAC AGTCCCGAGAAACGGCTGAGATTCGCGGGATCGTCACCACCATG 

61 GEVCYSPEKTAEIRG IVTTM 

241 ACCCATTCATTGACACGCCAGGTCGTACACAACAAACTGACGAGCTGC^CTAC^TCTG 

81 thsltrqvvhIkltsciyIl 

301 TTATACCTCGAAGCTGACGGGCGAATACGCTGCGGCAAAGTGAACGACAAGGCGCAGTAC 

ioi lyleadgrircgkvIdkaqy 

361 CTGCTGGGCGCCGCTGGCAGCGTTCCCTATCGATGGATCAACCTGGAATACGACAAGATA 

121 llgaagsvpyrwiIleydki 

421 acccggatcgtgggcctggatcagtacctggagagcgttaagaaacacaaacggctggat 
141 trivgldqylesvkkhkrld 



481 
161 



GTGTGCCGCGCTAAAATGGGCTATATGCTGCAGTGA 
VCRAKMGYMLQ* 
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Figure 14 



Translation of SEQUL130: HCK-5 (pUL130) 



1 ATGCTACGGCTTCTGCTTCGTCACCACTTTCACTGCCTGCTTCTGTGCGCGGTTTGGGCA 

1 MLRLLLRHHFHCLLLCAVWA 

61 ACGCCCTGTCTGGCGTCTCCGTGGTTCACGCTAACGGCGAACCAGAATCCGTCCCCGCCA 

21 TPCLASPWFTLTANQNPSPP 

121 TGGTCTAAACTG ACGTATCCC AAACCGC ATGACGCGGCGACGTTTT ACTGTCCTTTTCTC 

41 WSKLTYPKPHDAATFYCPFL 

181 TATCCCTCGCCCCCACGGTCCCCCTCGCAATTCCCGGGGTTCCAGCGGGTATCAACGGGT 

61 YPSPPRSPSQFPGFQRVSTG 

241 CCCGAGTGTCGC AACGAGACCCTGTATCTGCTGTACAACCGGGAAGGCCAGACCTTGGTG 

81 PECRNETLYLLYNREGQTLV 

301 GAGAGAAGCTCC ACCTGGGTGAAAAAGGTGATCTGGTATCTGAGCGGTCGCAATC AGACC 

101 ERSSTWVKKVIWYLSGRNQT 

361 ATCCTCCAACGGATGCCCCGAACGGCTTCGAAACCGAGCGACGGAAACGTGC AGATC AGC 

121 ILQRMPRTASKPSDGNVQIS 

421 GTGGAAG ACGCC AAGATTTTTGG AGCGC AC ATGGTGCCC AAGC AGACC AAGCTGCTACGT 

141 VEDAKIFGAHMVPKQTKLLR 

481 TTCGTCGTC AACGATGGCACACGTTATC AGATGTGTGTGATGAAACTGGAGAGCTGGGCC 

161 FVVNDGTRYQMCVMKLESWA 

541 C ACGTCTTCCGGGACTAC AGCGTGTCTTTTC AGGTGCGATTGACGTTC ACCGAGGCC AAT 

181 HVFRDYSVSFQVRLTFTEAN 

601 AACCAGACTTAC ACCTTCTGCACCCATCCCAATCTCATCGTTTGA 

201 NQTYTFCTHPNLIV* 



